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Abstract 

Efficiently managing laboratory test utilization requires both ensuring adequate utilization of needed tests in some patients and discouraging su- 
perfluous tests in other patients. After the difficult clinical decision is made to define the patients that do and do not need a test, a wealth of inter- 
ventions are available to the clinician and laboratorian to help guide appropriate utilization. These interventions are collectively referred to here as 
the utilization management toolbox. Experience has shown that some tools in the toolbox are weak and other are strong, and that tools are most 
effective when many are used simultaneously. While the outcomes of utilization management studies are not always as concrete as may be desired, 
what data is available in the literature indicate that strong utilization management interventions are safe and effective measures to improve patient 
health and reduce waste in an era of increasing financial pressure. 
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Introduction 

"(The) First Rule in government spending: Why 
build one when you can have two at twice the 
price?" - S.R. Hadden in Contact, 1997 (written by 
Carl Sagan). 

S.R. Hadden, the wealthy industrialist in the Carl 
Sagan drama Contact, may have been poking fun 
at stereotypical government spending practices 
when he quipped the line above, but his state- 
ment was more profound than it sounds, and it is 
especially relevant to clinical laboratory test utili- 
zation. The health care payment systems operat- 
ing today in many countries remove the clinicians 
who make utilization decisions from the financial 
consequences of their actions, with the result that 
some patients undergo at least twice the testing 
at twice the price, if not much more. 

That test overutilization exists is no longer an item 
of debate, and it is clearly an international problem 



(1). The reasons for overutilization are myriad, and 
may include as enabling factors even the technol- 
ogy we use to perform laboratory testing. There 
was once a time when it would be unthinkable 
and impractical for a hospital inpatient to get mul- 
tiple laboratory tests per day, every day (2), but in 
the present day, high frequency testing is almost 
the rule in resource-rich settings, rather than the 
exception. Instrument vendors now advertise turn- 
around times in minutes rather than hours or days 
for clinical assays, automated platforms handle 
hundreds to thousands of samples per hour, and 
data systems distribute the results of these tests 
electronically, directly to caregiver's electronic in- 
boxes. In the outpatient care setting, testing may 
not occur daily for each patient, but the barrier to 
getting testing can be incredibly low; the standard 
routine "physical" that a healthy adult might re- 
ceive at a doctor visit can be accompanied by a 
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panel of tests, perhaps only a blood count and 
electrolytes, but possibly also any one of a myriad 
of large so-called "wellness" panels offered by 
commercial laboratories that have financial incen- 
tives to drive frequent, high-cost testing. It does 
not simplify matters that additional technological 
advances, especially the genomic revolution, have 
exponentially increased the number of possible 
tests that are available. No one clinician can possi- 
bly know which novel test is appropriate in any 
given situation, and the problem compounds itself 
daily as additional genetic tests, small molecule 
mass spectrometry tests, or proteomic multivari- 
ate index assays are added to the global test menu. 
Absent any guidance, it appears that clinicians are 
apt to order whatever seems most familiar, any or 
all tests that sound like they might be appropriate, 
or whatever has been touted most heavily by am- 
bitious sales representatives. Another problem 
unique to academic settings is that one must al- 
low for the fact that physicians in training may 
need to order more than an optimal number of 
tests in order to learn how to use the results of this 
testing in the practice of medicine. 

So, what is a laboratory director to do in the face 
of this growing adversity? How do we ensure that 
that patient's interests come first, that neither in- 
patients nor outpatients get the wrong tests at all 
or the right tests too frequently, and that caregiv- 
ers are able to select tests appropriately so that 
they can most easily interpret the results? Labora- 
tory test utilization researchers, including this au- 
thor, have often summed up the literature in lec- 
tures or in private by stating that, "30% of labora- 
tory testing is likely wasteful," and now this esti- 
mate has been supported by a thorough meta- 
analysis (3). However, it is has never been clear ex- 
actly how laboratory directors or clinicians should 
go about identifying those 30% of tests, especially 
given that the wasted testing is not evenly distrib- 
uted across all patients or specific tests. What is 
needed, therefore, is a standardized approach for 
identifying malutilisation in daily clinical practice, 
and once it has been identified, a common set of 
tools should be available to fix the problem. This is 
the approach taken in this review. 



The problem of identifying inappropriate labora- 
tory test utilization is outlined below in the form of 
three "Rules," all of which are intended to provide 
the general rationale and goals of a laboratory test 
utilization management program. Afterwards, the 
laboratory test utilization management toolbox, 
comprised of a collection of evidence-based tools 
available to the laboratorian and clinician alike, will 
be described, with emphasis on those tools that 
are most appropriate for specific types of inappro- 
priate utilization. 

The three rules of laboratory test 
utilization 

Rule 1: "If you ask a stupid question, you get a 
stupid answer" 

Rule 1, otherwise known to the statistician as 
Bayes' Theorem, posits that the post-test probabil- 
ity of something being true is a product of the pri- 
or probability that the thing is true and the likeli- 
hood ratio provided by a test (4). In mathematical 
terms for laboratory testing, Bayes' Theorem can 
be expressed as: 



P(A\B) 



P(B\A) P(A) 
W) 



This can be translated to the context of laboratory 
medicine to mean that the conditional probability 
of disease (A) given the result of a test (B) (also 
called the "positive predictive value") is equal to 
the conditional probability of the result (B) given 
the disease (A) being present, multiplied by the 
probability of the disease (A) in the population 
(also called the "prevalence") and divided by the 
probability of the test giving the result (B) in all 
members of the population. 

One consequence of Rule 1 in laboratory testing is 
that ordering a test to rule in a condition when the 
prior probability is very low (i.e. a urine hCG test to 
assess pregnancy in an apparently male patient) is 
unlikely to be a fruitful endeavor, since negative 
results were already expected and positive results 
are most likely false positives. The same applies in 
the converse; one should not generally order a test 



Biochemia Medico 2014;24(2):223-34 



224 



http://dx.doi.org/10. 7 16 13/BM.201 4.025 



BairdG. 



Utilization management 



when it is extremely likely that the diagnosis in 
question is present. Rule 1 exists not just to dis- 
courage inappropriate ordering, but for the equal- 
ly important reason of saving one from the onus 
of interpreting highly unlikely test results. The true 
power of testing in the setting of low pre-test 
probability is ruling OUT a diagnosis; negative re- 
sults in these settings can be trusted, but positive 
results will always be confusing. 

Rule 1 is not always easy to apply, however, for 
three reasons. One, prior probabilities of disease 
close to 0 or 1 or quite unusual, as clinical, radio- 
logic, and historical findings do not always have 
enough evidentiary value to allow one to cease a 
workup without laboratory testing. The prior prob- 
ability of disease could also be completely un- 
known, in which case the Rule helps not at all. Two, 
many physicians are not always able to estimate 
prior probabilities with enough savvy to allow an 
accurate application of the Rule. Clinical decision 
support and education can partly address this is- 
sue, but patients are all unique, so there can be no 
absolute guide to prior probability assessment for 
all situations. Three, human beings are curious, es- 
pecially those who go into medicine. Curiosity 
manifests itself in the medical setting by explora- 
tion of unlikely possibilities, and while this may be 
an important educational activity, it should not be 
the basis for sound medical policy. 

The three challenges of Rule 1 notwithstanding, 
the Rule can still be followed effectively by 1) us- 
ing Bayesian thinking to assign at least relative, 
rather than absolute, likelihoods to various diag- 
noses to allow one to order sequential testing from 
most to least beneficial; 2) ceasing repetitive daily 
testing, or at least restricting testing to reflect our 
understanding of intraindividual biological varia- 
tion, because asking "too many questions" has the 
same effect as asking a single "stupid question"; 
and 3) always developing a plan for test interpre- 
tation PRIOR to ordering a test. This third point 
cannot be stressed enough, especially during 
medical training, and even more critically in deal- 
ing with the most curious among us. Clinicians of- 
ten construct large differential diagnoses in medi- 
cine to avoid missing the occasional rare presenta- 
tion or to avoid anchoring bias, or perhaps simply 
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to impress or intimidate their peers (5), but we 
should emphasize to our trainees that testing for 
many things simultaneously just because it's pos- 
sible or because "...I saw a patient like that once", 
is no reason to embark upon an unnecessarily 
costly diagnostic odyssey. Tests ordered out of 
pure curiosity have a funny way of presenting later 
as unexplainable positive results, and the cost of 
working up a false positive result is always larger 
than not ordering the test in the first place. 

Rule 2: "Laboratory testing is for sick people" 

If Rule 2 sounds like a restatement of Rule 1, that is 
because it is. Indeed, all of laboratory test utiliza- 
tion management stems from Bayes' theorem, al- 
though different restatements of the theorem pro- 
vide windows into distinct aspects of the utiliza- 
tion management problem. What Rule 2 focuses 
on is the well patient, and specifically, the notion 
of "Wellness Testing" (6,7). Wellness Testing com- 
monly refers to single tests or panels of tests that 
are intended to be performed on well patients, 
usually self-selecting adults, with the goal of pre- 
venting unwanted later complications of disease. 
Put in this way, Wellness Testing is in fact a misno- 
mer, in that the purpose of it is to discover that a 
patient is, or will become, unwell. In Bayesian 
terms, Wellness Testing poses a significant risk of 
diagnostic failure, or at least confusion, in the set- 
ting of low prior probabilities. In financial terms, 
however, Wellness Testing creates a highly attrac- 
tive market for laboratory vendors that earn mon- 
ey on a fee for service basis. 

Like all rules, however, Rule 2 has exceptions. Lipid 
testing, diabetes screening and newborn screen- 
ing, for example, are reasonable uses of laboratory 
testing in ostensibly healthy people, or at least 
people with prior probabilities of disease that are 
equivalent to the general population risk. Like 
many exceptions, these exceptions prove the rule. 
For lipid testing, one might argue that the stand- 
ard "healthy" patient in a population, for example 
a middle-aged male who is slightly overweight 
and exercises a tad too little, could reasonably be 
considered "sick" in terms of the risk that lipid-re- 
lated disorders pose to his long-term health. More 
importantly, however, the reason why lipid testing 
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and newborn screening are exceptions to Rule 2 is 
that there exists evidence demonstrating a benefit 
of the testing, when coupled with appropriate 
downstream therapeutic interventions. In a lipid 
panel, we test for analytes (cholesterol, lipopro- 
teins, triglycerides) that we know to be causally re- 
lated to cardiovascular disease and that we know 
respond to therapy, and we understand much of 
the risk-benefit relationship of either measuring or 
not measuring lipids (8). Likewise, despite the fact 
that inborn errors of metabolism are exceedingly 
rare, and the fact that the positive results from 
many newborn screening tests are false positives, 
the testing allows us an opportunity to confirm 
true positives and initiate therapy to avert diseases 
with extremely high morbidity and mortality that 
would otherwise be devastatingly difficult and ex- 
pensive to treat later in life. Numerous studies now 
indicate that newborn screening is cost-effective, 
as well (9-13). The WHO criteria for mass screening 
are a helpful resource for assessing the utility of 
any screening proposal (14). 

The limits to Rule 2 have exceptions. Lipid testing 
is helpful when limited to those analytes that have 
been studied in large cohorts like the Framingham 
study (15). Newer expanded lipid panels, such as 
those including genetic testing (16,17), additional 
information about the size or chromatographic 
mobility of lipoprotein particles (18,19), or addi- 
tional biochemical or proteomic biomarkers, run 
afoul of both Rule 1 and Rule 2. In some cases, 
these tests rely on evidence that is preliminary, 
limited, proprietary or otherwise insufficient to 
support widespread utilization, and in other cases 
these tests may in fact eventually find a place in 
appropriate lipid panels, but are currently waiting 
for evidence that shows us where to apply them 
most effectively. In the case of newborn screening, 
the calculus as to whether to include a test in a 
panel is actually fairly simple, as a test must be 
paired with a treatment that prevents either a dis- 
ease or the squeal of that disease. If there is noth- 
ing to do with the result of a newborn screening 
test, then it does not make sense to do the test. 
That this truism extends to all laboratory testing 
should be obvious. 
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Rule 3: "Too many good tests are the same as 
one bad test" 

Although this idea has been hinted at in Rules 1 
and 2, a separate rule should be reserved for the 
idea that repetitive tests or large panels of tests, 
even when comprised by individually reasonable 
tests, are a form of malutilisation. At the very least, 
too-frequent repetitive testing can come at odds 
with the Nyquist-Shannon theorem, a fundamen- 
tal principle of information theory (20). The math- 
ematical derivation of the Nyquist-Shannon theo- 
rem is beyond this discussion, but in words it can 
be said to define the appropriate relationship be- 
tween how often one should sample a varying sig- 
nal. In Figure 1, 4 potential situations are depicted, 
in which the same oscillating laboratory value (the 
"signal") is measured at successively longer inter- 
vals (the "sampling"). Clearly, plots A and B indicate 
oversampling, where far more laboratory tests 
were obtained relative to what was needed to de- 
fine the underlying signal, and plot D shows sig- 
nificant undersampling, leading to a misleading 
impression that the value is oscillating slower than 
it is in reality. Plot C, on the other hand, has pre- 
cisely as many sampling points as is necessary to 
describe accurately the underlying signal. In clini- 
cal terms, the Nyquist-Shannon theorem tells us 
that repeated testing to assess a change in a labo- 
ratory parameter should occur at intervals on par 
with the expected time it would take the analyte 
value to change. 

One obvious clinical manifestation of Nyquist- 
Shannon theorem in clinical laboratory is in Hae- 
moglobin A1c (HbA1c) testing. Because the longev- 
ity of red blood cells allows haemoglobin mole- 
cules to remain in circulation for several months, it 
makes no sense to assess HbA1c on a scale of days 
to weeks, but rather it should be measured on a 
scale of 1-3 months to assess changes. Despite the 
inherent logic of this conclusion, there is still sub- 
stantial variation amongst physicians in ordering 
HbA1c testing at rational intervals (21). It is also im- 
portant to understand that undersampling is a sig- 
nificant risk for HbA1c testing, as too-infrequent 
tests (i.e. a year or years between determinations 
in a poorly-controlled diabetic) can cause a physi- 
cian to miss clinically significant variation. 
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Figure 1. A hypothetical oscillating laboratory value (signal) with 
(B), 12 (C) and 31 (D) days. 



A second problem with too-frequent testing de- 
rives from the statistics of repetitive applications 
of a single test. While the sensitivity for disease de- 
tection increases when testing repetitively, the 
specificity must concomitantly decrease. It is often 
pointed out, for example, that the chances of hav- 
ing one abnormal value in a panel or series of 14 
tests whose reference ranges are defined by the 
central 95% percentile of the health population 
approaches 50%, even when the patient in ques- 
tion is totally normal (7). This counterintuitive re- 
sult is of course not entirely accurate, as it assumes 
that all laboratory tests vary independently (they 
do not), but it drives home the point that the more 
testing is done, the more disease is discovered. 
This is not always a bad thing, though, and repeti- 
tive testing placed at appropriate intervals (i.e. for 
lipid testing) can be highly beneficial to popula- 
tions. However, a disreputable laboratory vendor 
could also exploit this phenomenon to generate 
revenue from a very large panel of putative bi- 
omarker assays guaranteed to generate false posi- 
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tives that require follow-up testing. The phenom- 
enon creeps even into highly complex and expen- 
sive genetic testing panels performed at reputable 
laboratories. For example, there are numerous ge- 
netic syndromes that are known to be each caused 
by a number of different possible mutations, and 
reference laboratories will often offer a panel of 
tests that probes many or all possible mutations, 
causative or otherwise. In such cases, it is often the 
case that one specific mutation is causative in the 
majority of cases, such that testing for a single mu- 
tation first with a cheaper screening methodology 
may obviate the need for the expensive full panel 
in a majority of patients, lowering the overall cost 
of testing (22). Reflexive testing algorithms that al- 
low the providing laboratory to aid the clinician in 
defining the optimal sequential testing pathway 
are a key tool in the test utilization toolbox, and 
will be covered later. 

Evidence that repetitive or daily testing is common 
abounds in the historical literature (23-31), yet few- 
er studies have shown how repetitive testing can 

Biochemia Medica 2014;24(2):223-34 



227 



Baird G. 



Utilization management 



drive unnecessary and costly downstream activi- 
ties. In a pair of large academic medical centres, 
for example, daily ionized calcium testing had be- 
come ingrained into the culture of the house staff 
and was ordered in nearly every patient. When 
daily ionized calcium testing was reduced by ap- 
proximately 70% by introducing a reflexive testing 
strategy, doses of calcium administered by the 
pharmacy in the hospitals decreased by a very 
similar fraction, as did the diagnoses of "hypocal- 
cemia" rendered by the providers (32). While not 
enumerated in the report, it is likely that the cost 
savings of these downstream effects far out- 
stripped the laboratory-specific savings. No dis- 
cernible difference in morbidity or mortality was 
discovered after this intervention, even amongst 
diagnoses known to be associated with significant 
hypocalcemia (tetany, seizures, myocardial infarc- 
tion). In this case, a substantial burden of disease 
("hypocalcemia") was being invented by daily ion- 
ized calcium testing practices, and it disappeared 
within a month as testing practices were altered, 
raising the question of how many other fictitious 
diagnoses and unnecessary treatments inpatients 
may receive when they undergo daily routine lab- 
oratory testing. 

In another study that drew upon a large dataset 
comprising 4 million common outpatient tests 
performed in a large Canadian province (33), van 
Walraven et al. found that approximately 30% of 
testing for eight common analytes was repeated 
within a month. Potentially redundant testing, de- 
fined as testing repeated within defined time in- 
tervals, was estimated to have cost between $13.9 
and $35.9 million (Canadian) annually. As this study 
only focused on 8 common tests with low unit 
costs, it is likely that the true cost of all redundant 
testing to large health care systems is probably 
much larger, and perhaps even orders of magni- 
tude larger than what was observed in this study. 

Guidelines have been recently published describ- 
ing the "Minimum retesting interval", i.e. the mini- 
mum time before a test should be repeated(34). 
While this guideline is not comprehensive of all 
laboratory tests and all clinical situations, it pro- 
vides a substantial, evidence-based list of recom- 
mendations that covers much of the current prac- 
tice of medicine. 

Biochemia Medica 2014;24(2):223-34 



Applying the rules 

Applying the 3 rules of laboratory test utilization is 
a deceptively simple task, as identifying waste ap- 
pears simple. However, the number of people who 
have decried the third of laboratory testing that is 
waste likely far outstrips the number of people 
who have ever successfully managed laboratory 
test utilization. Laboratory test malutilisation is 
thus like the weather, in that "...everyone talks 
about it, but no one does anything about it". How 
can this be changed? 

First, in an institution desiring to curb inappropri- 
ate test utilization, there must be "buy-in" amongst 
the key stakeholders. There must be at least one 
member of the laboratory staff, a laboratory direc- 
tor, who is willing and able to expend the time 
needed to do the data analyses required to identi- 
fy the problem areas. If this laboratory director is 
in academia, as often occurs, their efforts in man- 
aging utilization should be both pursued and re- 
warded as genuine scholarly activities, with all 
work resulting in publications that allow others to 
learn from their successes and failures. If an aca- 
demic department values contributions to test uti- 
lization management below contributions to basic 
or applied sciences, then junior faculty with an in- 
terest in the field will simply not participate for fear 
of missing out on promotion, and the field will 
stagnate. Outside of laboratory staff, however, co- 
operative medical staff are needed in the remain- 
der of the hospital, especially those who are will- 
ing and able to take the time and energy required 
to participate in utilization review and quality im- 
provement activities. Second, as utilization man- 
agement is a data-intensive activity, there must be 
adequate information technology resources avail- 
able to the project leader. At a minimum, program- 
mers and/or individuals savvy in building, manag- 
ing and querying large databases are absolutely 
required to generate reports and analyses, or at 
the very least retrieve the raw data required to as- 
sess utilization. The ability to correlate laboratory 
test utilization data (orders) with test results and 
other patient data (metadata such as gender and 
age, or potentially diagnostic or outcome-related 
data) is key for assessing the opportunities and po- 
tential risks of utilization management interven- 
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tions, meaning that large relational healthcare in- 
formation databases(35), although expensive, will 
be very valuable in future utilization studies. 

The laboratory test utilization 
management toolbox 

The laboratory test utilization management tool- 
box is outlined in Table 1, with interventions that 
drive appropriate utilization sorted from strongest 
to weakest. For each tool, the intended target of 
the tool is listed, along with potential strengths 
and weaknesses, and finally a reference or exam- 
ple of how the tool can be used. The reference col- 



umn is not intended to be an exhaustive list of all 
published examples of a particular tool being put 
to use, for there are too many to include in this ta- 
ble, but rather the cited example(s) are intended 
to demonstrate an important strength or weak- 
ness of the tool. 

It is worth pointing out some tools that are not in- 
cluded in the toolbox because the available evi- 
dence suggests that they are not terribly effective. 
One of the common assumptions made about test 
utilization management is that clinicians would 
behave more rationally if they saw the cost of the 
laboratory tests they ordered. The results of inter- 
ventions that actually did this are mixed (36-38), 



Table I.The laboratory test utilization management toolbox. 



Strength Tool 



Target 



Strengths 



Weaknesses 



Example/ 
References 



Strong Ban the test 



Obsolete tests, 
"Quack" testing, 
Legitimate 
tests used in 
inappropriate 
circumstances 



This is the "Nuclear 
Option", as it ensures 
a complete cease to 
ordering 



Only useful for tests with broad Bleeding time 

consensus as to lack of utility, which and other 

is unusual. Specific individuals may "Antiquated" 

destroy consensus. tests (42). 



Laboratory 
Strong test 

formulary 



All tests, 
especially 
those with 
utilization that 
is recognized, 
after analytics, 
to be above 
what is 
expected or 
justifiable. 



A uniform policy 
across a system can 
be supported by a 
formulary, in the same 
way as a pharmacy 
formulary. Exceptions to 
formulary can be vetted 
by a committee or 
individual tasked with 
these decisions. 



Requires authority and buy-in 
from multiple factions in a medical University of 
system, and likely participation by Michigan (43). 
multiple specialties. 











Solomon 






By far the most 




meta-analysis 






effective, as the 


Logistically complex, as many 


(44), 


Strong 


Combined 

. ^ ^ Any test 
intervention 


strengths of one 


parties (the laboratory, clinicians, 


Massachusetts 


intervention often 


information services, payer systems, 


General Hospital 






complement the 


etc..) need to be involved. 


Experience (45), 






weaknesses of another. 




hematology 
testing (46). 



Strong 



Stop 

paying for 

unnecessary 

testing 



Any test 



Similarto banning 
tests, this intervention 
is effective at nearly 
ceasing testing, 
depending on who 
decides to stop paying. 



Depends on the payment system 
present in the medical system. 
Perceived as unfair, especially if 
the payer decides to stop paying 
for something without adequate 
justification. A physician may not 
know that a test will not be paid for, 
and the cost could be transferred to 
the patient. 



Trends in 
reimbursement 
shown here 
(47), example of 
medical policy 
here (40). 
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Table I.The laboratory test utilization management toolbox. 



Strength 



Tool 



Target 



Strengths 



Weaknesses 



Example/ 
References 



Strong 



Ban repetitive 
orders 



Daily inpatient 
tests 



Powerful method of 
reducing automatic 
ordering that providers 
often do not even know 
is occurring. 



Worry amongst some physicians 
that they might "miss something". 
Actual risk of missing something 
clinically important if a clinically 
indicated repetitive test is disallowed 
(i.e. coagulation tests in patients on 
anticoagulants). 



Make repeated 
orders difficult 
through 
computerized 
order entry (48), 
ban standing 
orders (49), limit 
tests to 24-hour 
period (50). 



Strong 



Privilege 
ordering 
providers 



Complex single 
tests, high 
unit cost and/ 
or difficult to 
interpret. 



Limiting testing to 
physicians who know 
how to use a test 
increases the prior 
probability in the tested 
patients, increasing 
cost effectiveness and 
diagnostic yield. 



Multiple physicians may want 
privileges, even in the absence of 
evidence that they deserve them. 



Neurogenetic 
testing diagnostic 
yield -30% for 
very rare diseases 
when expert 
providers order 
tests (51). 



Strong 



Require high 
level approval 



Complex single 
tests, high 
unit cost and/ 
or difficult to 
interpret. 



Laboratory providers 
can have more insight 
into the utility of some 
tests than generalist 
providers. 



Time consuming for laboratory staff Large Genetics 
or director, especially if there are no Sendout Testing 
laboratory housestaff to take calls. Intervention (52). 



Strong 



Change Any test in a 

computerized system with 

order entry computerized 

options ordering. 



Computerized order 
changes can be made 
far more difficult to 
subvert than paper 
order form changes. 



In the absence of a cultural change 

supporting modification of ordering Reducing testing 

practices, a complete stop to a in coronary care 

specific order may increase provider unit (53). Change 

abrasion. Unintended consequences to routine testing 

can result if one is not careful in menu (54). 
designing the intervention. 



Strong 



Offer reflexive 
testing. 



Any test where 
a cheaper 
screening test 
can be used 
before a more 
costly test. 



Can work for 
computerized or paper 
ordering. Is a form of 
decision support that 
allows physician to 
follow correct testing 
algorithm with one 
order or click. Increases 
pre-test probability 
for more costly tests, 
making them more 
interpretable. 



Requires an analyte for which a 
cheaper screening test exists. If 
using paper forms, one must realize 
that paper forms have a significant 
half-life in medical systems, and 
forms usually allow providers to 
"write in" tests that they cannot find 
on the form, thus allowing clinicians 
to subvert the intent of the reflexive 
panel. 



Reflexive ionized 
calcium (32), 
coagulation 
panels (55). 



Moderate/ Utilization 
Strong report cards 



Routine 
outpatient 
panel testing, 
daily testing on 
inpatients. 



Provides data on 
ordering to providers 
who may otherwise 
have no idea how they 
order tests, and thus 
may allow them to make 
informed decisions. 
Can be paired with 
reimbursement/financial 
penalties, or associated 
with peer feedback for 
added strength. 



No one has to read the report card, 
especially if it is not associated with 
an incentive. 



Outpatient 
report cards 
(56), intermittent 
feedback for 
physicians (57,58). 
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Table I.The laboratory test utilization management toolbox. 



Strength 



Tool 



Target 



Strengths 



Weaknesses 



Example/ 
References 



Moderate 



Computerized 
reminders/ 
decision 
support 



Selected tests 
with moderate 
volume and 
high likelihood 
of being 
misordered. 



Can provide support in 
real time to physicians 
to increase prior 
probabilities. 



"Pop-up fatigue" occurs if too many 
reminders are implemented, leading 
to provider abrasion. Providers 
will also cease to continue to read 
pop-ups after some time. 



Magnesium 
intervention (41), 
1,25 dihydroxy 
Vitamin D email 
reminder (59) 
[cited example 
also uses 
privileging]. 



Weak 



Post 

guidelines on 
paper order 
forms 



Selected tests 
with moderate 
volume and 
high likelihood 
of being 
misordered, 
but no 

computerized 

ordering 

available. 



Can provide support in 
real time to physicians 
to increase prior 
probabilities. 



As opposed to pop-ups on 
computerized forms, written 
guidelines on a paper are likely 
easier to ignore. 



Redesigning test 
requisitions and 
promulgation 
of factsheets 
(46,60). 



Weak 



Education 
alone/call for 
enhanced 
vigilance 



Required as a 
component of nearly 
all successful utilization 
management efforts. 
Interventions lacking an 
Any test educational component 

risk failure due to lack of 
buy-in from interested 
parties who do not 
understand the purpose 
of the change. 



Almost never works alone, or when 
it does, the effect wears off over time 
or completely disappears if new staff 
takes over (i.e. in a teaching hospital). 



Example showing 
effect wearing off 
after time (38), 
mixed effects 
of reminding 
physicians of test 
costs (36-38). 



however, and do not support whether or not pro- 
vision of cost information has any true effect on 
ordering. While it does make sense that some in- 
formation about cost might influence physician 
ordering behavior, the potential for a successful in- 
tervention based on this information is confound- 
ed by the fact that the true costs of laboratory 
tests are actually quite small, compared to more 
expensive items like radiologic scans, and perhaps 
more frustrating, it is often very difficult to deter- 
mine what the actual cost of a laboratory test is in 
some health care delivery systems. In the United 
States, for example, specific laboratory tests are 
not usually reimbursed by third party payers for 
inpatient stays (adults with rare diseases and some 
children are notable exceptions), and the list prices 
of tests are often wildly inflated and/or kept secret. 
Therefore, while it is isn't even clear what cost 

http://dx.doi.org/10. 1 1 61 3/BM.201 4.025 



should be shown to ordering physicians, the truth 
of the matter is that the most overutilised routine 
laboratory tests do not have high unit costs, as the 
variable cost for additional reagents needed to run 
a test on a large automated platform are miniscule 
compared to the difficult-to-assess fixed costs of 
maintaining that instrument and staffing required 
to run it. What we might want to show physicians 
at the time of order would be the potential down- 
stream costs associated with following up unnec- 
essary testing or the potential harm that could 
arise to their patients, but those costs and harms 
are difficult or impossible to tabulate. 

Another financially-based tool that has been pro- 
posed is to simply pay physicians to stop ordering 
laboratory testing. This is a difficult tool to admin- 
ister or evaluate ethically, but it has been done us- 
ing trainees. In this study (39), medical residents 
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were paid in "book money" on a scale commensu- 
rate with the size of their collective percentage re- 
ductions in laboratory and radiology test ordering. 
Notably, residents were not individually rewarded 
for their individual performance, but rather for the 
performance of a group. There were overall mod- 
est reductions in test ordering through this inter- 
vention, but they were comparable to the control 
group, and they rebounded to pre-intervention 
levels after the study ceased. Residents who par- 
ticipated in simple chart reviews to study their 
own utilization patterns in this study had more sig- 
nificant and longer-lasting reductions in test utili- 
zation. 

Whether or not tools based on provision of finan- 
cial information or cash bonuses can be consid- 
ered effective is perhaps not as important as the 
fact that third party payers or health care systems 
are already implementing interventions based on 
financial incentives (mostly negative incentives), 
and they will continue to implement more such in- 
centives. While early targets are likely to be areas 
where the motivations for overutilization are pri- 
marily financially driven, i.e. fee-for-service reim- 
bursement of tests with negligible or no clinical 
utility, US insurers are already starting to write 
medical policies that incorporate test utilization 
principles (40). 

Using the toolbox 

While banning tests entirely or codifying a labora- 
tory test formulary are listed in the toolbox as the 
strongest interventions, it must be restated that 
the most successful interventions in laboratory 
test utilization management are those that com- 
bine various tools. One can hardly expect an inter- 
vention that bans a physician's favourite test to 
succeed without an educational component aimed 
at explaining to that physician why the test is no 
longer available. The same goes for computerized 
order entry changes. While these changes seem 
the least disruptive, i.e. a single check box might 
disappear, the interconnectedness of health infor- 
mation systems is such that small changes to a lab- 
oratory's ordering interface can have unexpected 
and profound downstream changes. A computer- 
ized order entry-based intervention targeted at re- 
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ducing serum magnesium testing was found to 
paradoxically increase magnesium testing, for ex- 
ample, by inadvertently encouraging physicians to 
order serum magnesium together with serum cal- 
cium and phosphorus (41). The paradoxical effect 
found in this study was observed through con- 
stant utilization monitoring, allowing the investi- 
gators to institute a new, successful intervention, 
highlighting the fact that monitoring is a neces- 
sary part of all utilization management tools. With- 
out ongoing monitoring of utilization, it is impos- 
sible to assess the effectiveness of any utilization 
management intervention. As was indicated in 
Rule 3 above, test utilization should be monitored 
on a timescale similar to how fast it is expected to 
change; it is not enough to measure utilization at 
the end of a project or the end of the year. 

A final critical factor required for using the utiliza- 
tion toolbox is the development of a rapport be- 
tween the laboratory and the rest of the hospital. 
Clinicians who order tests are rightfully wary of 
mysterious and draconian utilization management 
interventions forced upon them by unknown enti- 
ties. Making things worse, in this author's experi- 
ence, clinicians are often surprised to learn that 
there are doctoral-level scientists and clinicians 
employed by the laboratory who are concerned 
with optimal utilization of laboratory tests. As edu- 
cation is a key tool in the toolbox, we in the labora- 
tory must therefore embrace our roles as educa- 
tors to ensure that the ideas we have are under- 
stood and appreciated by clinicians and patients 
alike. When done appropriately, utilization man- 
agement interventions can lead to improved mo- 
rale not just amongst laboratory staff, but amongst 
all parties involved in the process. Without all three 
entities (laboratory, clinician, patient) having ade- 
quate information and interest in participation, 
optimal utilization of laboratory testing cannot be 
achieved. 
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